PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Araha.34278s0001.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis
Family CPP
Protein Properties Length: 675aa    MW: 73862.2 Da    PI: 5.1637
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Araha.34278s0001.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.63.9e-16373411240
                   TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                           ++k+CnCkkskClk+YCeCfaag++C e C+C dC+Nk 
  Araha.34278s0001.1.p 373 SCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCIDCFNKP 411
                           689**********************************96 PP

2TCR51.12.7e-16458496139
                   TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                           ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  Araha.34278s0001.1.p 458 RHKRGCNCKKSNCLKKYCECYQGGVGCSINCRCEGCKNA 496
                           589***********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011142.3E-19372413IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163437.451373498IPR005172CRC domain
PfamPF036386.4E-12375410IPR005172CRC domain
SMARTSM011143.7E-17458499IPR033467Tesmin/TSO1-like CXC domain
PfamPF036381.1E-11460496IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 675 aa     Download sequence    Send to blast
MDTPQKSVTQ IGTPISKSRF EDSPVFNYIN SLSPIRPVRS IPNPNQFSSL NFTSPPSVFT  60
SPHLNSSHKE SRFFKTHNSS SSSSSSDPTN PVESREDEST SNEDVVAEGD DTKDLNIDAS  120
MREEETTRDD SVASPCGGDT TDLSLVPYAP LGENGSSENA GMELQKVYDN VQGKSETPDW  180
ESLISDASEL LIFDSPDASE AFRCFMMQRA SNSEARFSNG VEVQTMQPDS NRELESANAI  240
PYEAVSLLHR GIRRRCLDFE MPGNKQTLSE NNTATCESSS RCVVPSIGLH LNAILMSSKD  300
CKSNVSHDYS CSGKIQVGLQ SSISTLQETL DQTENETRED ADQDVPVEPA LQELNLSSPK  360
KKRVKLDSGE GESCKRCNCK KSKCLKLYCE CFAAGVYCIE PCSCIDCFNK PIHEDVVLAT  420
RKQIESRNPL AFAPKVIRNS ESVLETGDDA SKTPASARHK RGCNCKKSNC LKKYCECYQG  480
GVGCSINCRC EGCKNAFGRK DGSSIDMEAE QEEENETSEK SRTAKAQQNI EVLMRKEARS  540
DLPTTPTPIY RPELVQLPFS SSKNRMPPPQ SLLGGGSSSG IFNSQYLRKP DISLTQSRIE  600
KSSETVAEDG AEEMPEILIH SPIPNIKSVS PNGKRVSPPH MESSSSGSIL GRRSGGRKLI  660
LQSIPSFPSL TPQH*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1358365PKKKRVKL
2358366PKKKRVKLD
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00433DAPTransfer from AT4G14770Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAL1615390.0AL161539.2 Arabidopsis thaliana DNA chromosome 4, contig fragment No. 39.
GenBankCP0026870.0CP002687.1 Arabidopsis thaliana chromosome 4 sequence.
GenBankZ973370.0Z97337.2 Arabidopsis thaliana DNA chromosome 4, ESSA I FCA contig fragment No. 2.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002868254.10.0hypothetical protein ARALYDRAFT_493420
SwissprotF4JIF50.0TCX2_ARATH; Protein tesmin/TSO1-like CXC 2
TrEMBLD7MBC70.0D7MBC7_ARALL; Putative uncharacterized protein
STRINGfgenesh2_kg.7__2853__AT4G14770.10.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM14672891
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.10.0TESMIN/TSO1-like CXC 2